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BINARY QUASARS AT HIGH REDSHIFT I: 24 NEW QUASAR PAIRS AT Z - 3 - 4 
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ABSTRACT 

The clustering of quasars on small scales yields fundamental constraints on models of quasar evo- 
lution and the buildup of supermassive black holes. This paper describes the first systematic survey 
to discover high redshift binary quasars. Using color-selection and photometric redshift techniques, 
we searched 8142 deg 2 of SDSS imaging data for binary quasar candidates, and confirmed them with 
follow-up spectroscopy. Our sample of 27 high redshift binaries (24 of them new discoveries) at red- 
shifts 2.9 < z < 4.3 with proper transverse separations 10 kpc < R± < 650 kpc increases the number 
of such objects known by an order of magnitude. Eight members of this sample are very close pairs 
with R± < 100 kpc, and of these close systems four are at z > 3.5. The completeness and efficiency of 
our well-defined selection algorithm are quantified using simulated photometry and we find that our 
sample is ~ 50% complete. Our companion paper uses this knowledge to make the first measurement 
of the small-scale clustering (R < 1 hr 1 Mpc comoving) of high-redshift quasars. High redshift bi- 
naries constitute exponentially rare coincidences of two extreme (M > 10 9 M Q ) supermassive black 
holes. At z ~ 4 there is about one close binary per 10 Gpc 3 , thus these could be the highest sigma 
peaks, the analogs of superclusters, in the early Universe. 

Subject headings: cosmology: observations - large-scale structure of universe - quasars: general - 
surveys 



1. INTRODUCTION 

Astronomers do not yet understand how quasars fit 
into the galaxy formation hierarchy and how they re- 
late to the underlying structure and evolution of the 
dark matter. In the current paradigm, every mas- 
sive galaxy is hosted by a dark matter halo and is 
thought to have undergone a luminous quasar phase; 
in this picture quasars at high redshift are the pro- 
genitors of the local dormant supermassive black holes 
found in the centers of all nearby bulge-dominated 
galaxies. This fundamental connection is supported 
by the tight correlations observed between the masses 
of central black holes and the velocity dispersions (or 
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masse s) of their old stellar popul a tions (iMagorrian et al 
P998t ~ 



et al.l 
2000; 



_ .Ferrarese fc Merrittl 120001: IGebhardt et al . 
Tremaine et al.l 120021: iHaring fc Rbd l2004h and by the 
fact that the black hole mass density in the local 
Universe is commensurate with the luminosit y den- 
sity produced by quasars at hi g h redshift (e.g. ISoltanl 
1982 _ Small & Blandford 1992: lYu fc Tremaine! 120021 



Haiman et all 12004 iMarconi et all 120041 : IShankar etldl 
20041 l2008aHShenll2009H . ~ 



Qu asars are likely to reside in massive hosts (|Turnerl 
1 1 9 9 lh and it has been suggested that they occupy 
the rarest peaks in the initial Gaussian d ensity 
fluctuation distribution (lEfstathiqu fe Reei 119881: 



Cole fc Kaiserl I fl989t iNusser fc Silld Il993t iDjorgovskil 
19991: iDjorgovski et all 119991 ). It is also thought that 
quasar activity is triggered by the frequent galaxy 



merger which are a generic consequence of bottom up 



19931: iBarnes & Hernauistl 119961: 


Bahcall et all 19971 


Thacker et al. 2006; Hopkins et al 


20081: iWetzel et all 


2009_b|JaJ), and merger-based models provide a good 



description of many of the obse rved aggregate proper- 
ties of the quasar p o pulatio n ( Haiman fc Loebl 119981: 
Cavaliere fc Vittorinil [20001: [Kauffm ann fc Haehnelfj 



2000r iWvithe fc Loebl [20021: iThacker et all l200f 
Hopkins e t alj|2006t Tha cker et al. 2008: lShenll2009D . 

In a hierarchical Universe, the large-scale cluster- 
ing of a population of objects is directly related to 
their host dark matter halo masses (|Cole fc KaiseH[l989l : 
IMo fc White! [19961) . Hence measuring quasar cluster- 
ing probes the relationship between quasar luminosity 
and halo mass, pro yiding constraints on the quasar life- 
time or duty cycle (ICole fc Kaiserlll989l E aiman fc Huil 
12001 iMartini fc Weinberg| l2001. but see Wyithe & Loeb 
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2008), physical parameters governing black hole fuel- 
ing such as the radiative e fficiency and Eddingt o n ra- 
tio (iWvithe fc Loebl l2005t IShankar et all 12008b!: IShen 



2009), and the scatter in these parameters (jWhite et al 
20081 ). The advent of large spectroscopic quasar sur- 
yeys has led to firm constraints on t h e clustering, 
(|Porciani et all l2004t ICroom et all 120051 : IMvers et all 
20061 12007at Ida Angela et al.ll2008l: iPadmanabhan et all 
2008b) : IShen et alJ (2008U iRoss et al.l I2009Q of quasars 
at z < 2.5 like luminous galaxies, quasars 
show little apparent clustering evolution with red- 
shift. In addition, the first esti mates of the depen- 
dence of clustering on lu minosity dPorciani &: Norberd 
[200l IShen et al.l l2008bf) . redshift ([Shen et alJ 120071) , 
the presence of br oad absorption line (BA L) troughs 
(IShen et al.1 [2008a), and radio brightness (|Shen et al.1 
2008b) have been made. The cross-correlation of 
quasars wit h galaxies has also resulted in co mplementary 
constraints jA dclbcrgcr & Stcidcl 2005b, al: ISerber et al.l 
120061: ICoil et al.l l2007: Pa dmanabhan et al.ll2008bf ). 

The clustering of high redshift quasa r s is much more 
challe nging to measure (|Kundid 119971 : IStephens et al.1 
119971) because they are int rinsically rare. B ut the Sloan 
Digital Sky Survey CSDSS: rYor¥^taIll2000h has recently 
enabled a measu rement of q uasar clustering on large 
scales at z > 3. IShen et all (|2007h measured the clus- 
tering of quasars from 2.9 < z < 5.4 using a sam- 
ple of > 4000 quasars selected from the SDSS DR5 
(|Adelman-McCarthy et all [2007). Dividing their sam- 
ple into two redshift bins, they measured comoving cor- 
relation lengths (7 = 2) of r = 16.9 ± 1.7 h' 1 Mpc 
and r = 24.3 ± 2.4 /i" 1 Mpc, for 2.9 < z < 3.5 
and z > 3.5, respectively. Thus z > 3 quasars clus- 
ter much more strongly than their z < 2.5 counter- 
parts (r ~ 7 h~ x Mpc) , implying that high-redshift 
quasars are highly biased and inhabit very massive (M > 
10 13 hr 1 M Q ) dark matter halos. These large scale clus- 
tering constraints at low and high redshift now play a 
fundamental role in our understanding of the quasar 
evolution and the buil dup of sup ermassive black holes 
(IThacker et al.1 120061: i Lidz et all 120061: iHopkins et all 
2007at IThacker et al I 120081: iDiorgovski et al.l I200E' 



Hopkins et all 120081: IWhite et al.1 120081 : IShankar et al l 
2008b nShCTll2009h . 

Quasar clustering on small scales (R < 200 kpc proper) 
yields independe nt and complemen tary constraints on 
quasar evolution. lDjorgovs"kH (|1991l ) first suggested that 
the handful of binary quasars known at the time im- 
plied a factor of ~ 100 larger clustering amplitude on 
scales R < 50 kpc than predicted by naively extrapo- 
lating the quasar correlation function power law from 
larger scales, and proposed that this was due to the 
enhancement of quasa r activity during merger events. 
iHennawi et al.l (|2006bf ) conducted an extensive follow-up 
campaign to find companions of SDSS quasars, provid- 
ing the first measurement of the quasar correlation func- 
tion on scales 10 h kpc < i? pr0 p < 400 hr 1 kpc. For 
-Rprop 50 hkpc, they detected an order of magnitude 
higher clustering than the expectation from the larger- 
scale quasar correlation function extrapolated as a power 
law, providing compelling evidence that the quasar cor- 
relation function steepens on sub-Mpc scales. This re- 
sult has since been confirmed with better accuracy using 



more homogeneous selected photo metric (IMvers et all 
l2007bh and spectroscopic samples (jMvers et al.1 12008). 
albeit with less excess over a power law. Additional cir- 
cumstantial evidence for enhanced small scale cluster- 
ing com es from the recent dis covery of the first triple 
quasar (jDjorgovski et al] 120071 ) . Similarly large small- 
scale clusteri ng may be present around low-redshift A GN 
at z < 0.6 (jSerber et al.l 120061: IStTand et al.ll2008l but 
see Padmanabhan et al. 2008). Motivated by these 
small scale measurements, several theoretical investiga- 
tions have argued that excess small scale clustering oc- 
curs naturally i f qua s ar activity is trigger e d by mergers 
(IThacker et al.l 120061: flopkins et all 120081: IWetzel et all 
I2009bt IThacker et alJ 120081) and have considered the 
constraints that small scale clustering measurements 
lace on how quasars populate their dar k matter halos 
Wetzel et aDl2009bl: IShankar et al.lf2009h . 

Measuring small scale quasar clustering is of partic- 
ular interest at high redshifts (z > 3), where the pop- 
ulation of supermassive black holes powering quasars 
is rapidly growing and the large scale clustering indi- 
cates that quasars are hosted by extremely massive dark 
matter halos (M > 10 13 h^ 1 M ), which are the pro- 
genitors of today's rich clusters. Could the mechanism 
which triggers the growth of these extreme black holes 
(M > 10 9 M at high redshift) be different from that 
at lower redshifts? Are z ~ 4 quasars always at the 
center of their host dark matter halo, or can they be 
hosted by less massive satellites? If a high redshift bi- 
nary quasar represents an exponentially rare coincidence 
of two extremely massive black holes (and dark matter 
halos/subhalos), are these the highest sigma peaks, the 
analogs of superclusters, in the early Universe? The evo- 
lution of small scale clustering to high redshift can shed 
light on all of these questions. Ho wever, even in the large 
z > 3 quasar sample studied by IShen et ail (|2007[ ) , the 
smallest scale at which the correlation function can be 
measured is ~ 2 hr 1 Mpc. The reason for this is twofold. 
First, close quasar pairs with angular separations < 60" 
(~ 1.5 h^ 1 Mpc at z ~ 4), are extremely rare — the 
mean quasar separation at z > 3 is ~ 150 h^ 1 Mpc and 
at small separations, the correlation function does not 
increase as fast as the volume decreases. Second, be- 
cause of the finite size of the optical fibers of the SDSS 
multi-object spectrograph, only one quasar in a close pair 
with separation < 55 " can be observed on a given plate 13 
(Blan ton et al.ll2~003f) . The only sub-arcminute high red- 
shift quasar pair in the literature is a 33 " pair of quasars 
at z = 4 25 discovered serendipit ously bv lSchneider et all 
( 2000). IDiorgovski et al.1 (|2003l ) discovered a quasar at 
z = 5.02 196" away f rom the h i gh red shift quasar at 
z = 4.96 discovered bv lFan et al.1 (|1999h . corresponding 
to a proper transverse separation of 900 h^ 1 kpc. This 
is the highest redshift pair of quasars known. 

In this paper, we present the results of a systematic 
survey to discover high redshift binary quasars and mea- 
sure the small-scale clustering of quasars at z ~ 3 — 4 for 
the first time. Using color-selection and photometric red- 
shift techniques, we searched the SDSS photometric data 

13 An exception to this rule exists for a fraction (~ 30%) of 
the area of the SDSS spectroscopic survey covered by overlapping 
plates. Because the same area of sky was observed spectroscopi- 
cally on more than one occasion, there is no fiber collision limitation 
for these regions. 
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for binary quasar candidates, which we confirmed spec- 
troscopically with follow-up observations. The outline of 
this paper is as follows. In §[51 we present our algorithm 
for finding high redshift binary quasars candidates in the 
SDSS photometric data, and quantify the completeness 
and efficiency of our search. Our spectroscopic follow-up 
observations are described in § and the binary quasar 
sample is presented in § [4[ where we also summarize the 
status of our survey. We summarize and conclude in § [5j 
In a companion paper (jShen et al.ll2009f) (henceforth Pa- 
per II), we use the binaries presented here to measure 
the small-scale (R < 1 h~ x Mpc comoving) clustering of 
high redshift quasars. 

Throughout this paper we use the best fit Wilkin- 
son Microwave Aniso tropy Probe cosmological model of 
iDunklev et all (pOOl ). with Q m = 0.26, Q A = 0.74, 
h = 0.7. Because both proper and comoving distances 
are used in this paper, we will always explicitly indicate 
proper distances (quoted in kpc) . It is helpful to remem- 
ber that in the chosen cosmology, for a typical quasar red- 
shift of z = 3.5, an angular separation of Ad = 1" corre- 
sponds to a proper transverse separation of R± = 7.6 kpc 
(comoving R± = 24.1 h^ 1 kpc), and a velocity difference 
of 1000 km s~ at this redshift corresponds to a proper 
radial redshift space distance of S|| = 2.9 Mpc (comoving 
S|| = 9.1 h~ x Mpc). All magnitudes quoted are SDSS as- 
inh magnitudes (jLupton et al.lll999| ). To avoid confusion 
between redshift and z-band magnitude, we will always 
use a text "z" to denote redshift. 

2. FINDING BINARY QUASARS 

Because binary quasars at z > 3 are extremely rare, 
we must target candidates to i = 21, as faint as the 
SDSS imaging data will allow, to build up statistics. This 
flux limit is significantly fainter than the SDSS spectro- 
scopic flux limit (i < 20.2), and because fiber collisions 
exclude close pairs in SDSS spectroscopy, we must select 
binary candidates from the imaging data and then follow 
them up spectroscopically. Finding binary quasars thus 
amounts to constructing a photometric quasar catalog 
with a high completeness and efficiency at faint magni- 
tudes. 

Our algorithm for selecting z > 3 binary quasars 
evolved somewhat over the course of our survey. We ex- 
perimented with different approaches, such as searching 
for companions o nly around spec t roscopi cally confirmed 
quasars following iHennawi et al.1 (|2006bD , searching for 
pairs i n the phot ometric quasar catalog of lRichards et al.1 
((20091) ( see also iRichards et aill2004l ). and constructing 
our own photometric quasar catalog. The final algorithm 
adopted is an amalgam of these methods and we describe 
it in detail in what follows. In § 12. H we provide de- 
tails of the SDSS spectroscopic quasar sample. Many of 
our binary candidates have one member bright enough 
for the spectroscopic survey, and these data are used 
whenever possible. T he photometric quasar catalog of 
IRichards et"aU (|2009t ) is described in § O This catalog 
did not perform well enough at high redshift to be our 
sole selection algorithm, however we utilize it in an aux- 
iliary manner to improve the efficiency of our search. An 
important step in understanding the completeness of our 
binary selection was the construction of simulated quasar 
data, as we describe in S 12.31 The core of our selection 



algorithm is the construction of a photometric quasar 
catalog which carefully treats photometric errors, which 
we introduce in § !2.4l and apply to the SDSS imaging data 
in § 12.51 The final algorithm for selecting binaries is pre- 
sented in § I2.6( where we also estimate the completeness 
of our search - a necessary step for the small scale quasar 
clustering constra ints presented in our companion paper 
(iShen et al.ll2009fl . 

2.1. The SDSS Spectroscopic Quasar Sample 

The Slo an Digital Sky Su rvey uses a dedicated 2.5m 
telescop e dGunn et al.l |2006[ ) and a large format CCD 
camera (|Gunn et al. I1998T ) at the Apache Point Obser- 
vatory in New Mexico to obtain images in five broad 
bands (u, <?, r, i and z, centered at 3551, 4686, 
6166 7480 and 8932 A, r espectively; iFukugita et~ai1 
119961: IStoughton et all l2002f ) of high Galactic latitude 
sky in the Northern Galactic Cap. The im aging data 
are p rocessed by the astrometric pipeline ( | Pier et al.l 
120031) and photometric pipeline (|Lupton et al.l 12001). 
and are phot ometrically c alibra t ed to a standard 
star network (ISmith et a I l2002t iTucker et all 120061 : 
iPadmanabhan et al.l 12008a ). Additional details on the 
SDSS data products can be found in the da ta release 
papers (e.g. lAbazaiian et"aLl[2003l . I21MI2005T ). 

Quasar candidates are targeted for follow-up spec- 
troscopy based on colors measu red from the SDSS imag- 
ing data (Ric hards et al.ll2002af ). These candidates along 
with other targets (i.e. galaxies, stars, serendipity), 
are observed with two double spectrographs produc- 
ing spectra covering 3800-9200 A with a spectral res- 
olution ranging from 1800 to 2100. Det ails of the 
spectr os copic observation s can be fou nd in lYork et al 



2000), ICastander et all ((2001), and IStoughton etal 



2002) . Quasars observed through th e Fifth Data Re 



lease (lAdelman- McCarthy et ak| |2007f) have been cata- 
logued by ISchneider et al.l ((20071 see also Schneider et 
al. 2005). 

High-redshift quasars are optically unresolved and thus 
cannot be distinguished from stars based on image mor- 
phology. The majority of quasar candidates are selected 
based on their location in the multidimensional SDSS 
color-space. All magnitude s are reddening correc ted fol- 
lowing the prescription in ISchlegel et all (fl998h . Ob- 
jects with colors that place them far from the stellar 
locus and which do not inhabit specific 'exclusion' re- 
gions (e.g., places dominated by white dwarf's, A stars, 
and M star-white dwarf pairs (|Smolcic et al.ll200l ) are 
identified as primary quasar candidates. An i magnitude 
limit of 19.1 is imposed for candidates whose colors in- 
dicate a probable redshift of less than ss 3, while higher 
redshift candidates are targeted if i < 20.2. The colors 
of quasars become difficult to distinguish f rom F-stars 
at z w 2.7 (|Fadll999t (Richards et al.ll2001af) . which sig- 
nificantly lowers the efficiency of quasar target selection. 
The SDSS quasar target selection therefore deliberately 
sparse samples in tha t region of color space, result ing in 
high incompleteness ((Richards et al.1 l2002al l2006h near 
this redshift. We therefore focus our high redshift binary 
quasar search at z > 2.9. 

2.2. The SDSS KDE Photometric Sample 

IRichards et~afl (|2004l) and IRichards et alj (|2009t ) have 
constructed faint (i < 21) photometric samples of 
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quasars from the SDSS photometry alone, by separat- 
ing quasars from stars using knowledge of their relative 
densities in color space. A technique known as Ker- 
nel Density Estimation (henceforth KDE, or 'the KDE 
technique') uses training sets of quasars and stars to ob- 
tain non-parametric estimates of their respective densi- 
ties in color space. Then a probability can be assigned for 
any photometric object of interest for being quasar-like, 
or star-like. Objects with quasar-like probability above 
a threshold are selected as photometric quasars, and 
are assigned a photometric redshift, a nd a probability 
that the photometric reds hift is correct jW einstei n et all 
120041 ). The most recent iRichards et~atl (|2009f ) photo- 
metric quasar catalog of (henceforth the DR6 KDE cat- 
alog) covers the 8417 deg 2 area corresponding to the 
SPSS Data Release 6 (DR6; lAdelman-McCarthv et~aTl 
2008) , extends down to an extinction corrected flux limit 
of i < 21.3, and attempts to classify quasars over the 
full range of redshifts accessible to the SDSS imaging, 
< z < 5.5. 

The fundamental drawback of the KDE approach is 
that the probabilities of an object being quasar-like or 
star-like do not take into account photometric errors. At 
faint magnitudes where errors become large, the KDE 
catalog suffers from incompleteness which is not cur- 
rently understood. Since a quantitative estimate of the 
completeness of our selection is necessary to place con- 
straints on small-scale quasar clustering, using the KDE 
catalog alone as our selection algorithm is insufficient. 
Of the 27 z > 2.9 binary quasars in the SDSS footprint 
(see Table[3]), 25 have both members bright enough to 
be in the KDE catalog. Of these, 17 have both members 
in the KDE catalog and the other eight have only one 
member in the KDE. 

2.3. Simulated Binary Quasars 

Our binary survey extends to i — 21, and because of 
photometric errors, the completeness decreases to fainter 
magnitudes. Hence we need to generated simulated bi- 
nary quasars to determine our completeness, as we do 
not have a large sample of spectroscopically confirmed 
z > 2.9 quasars at magnitudes fainter than the SDSS 
quasar survey flux limit of i = 20.2. 

It will prove to be convenient if the simulated quasars 
have an approximately correct redshift and number count 
distribution. Even at bright magnitudes, the redshift 
and magnitude distribution of the SDSS quasar sam- 
ple has the selection function of the targeting algorithm 
imprinted on it. In particular, the SDSS spectroscopic 
sample suffers high incomple teness at z = 2.7 — 3 .0 and 
at z ~ 3.4 (see Figure 8 of IRichards et all [20061. In- 
stead, we determine the redshift and number count dis- 
tribution from the measured quasar luminosity function 
(LF). Usable luminosity functions for quasars at redshift 
2.9 < z < 4.5 whic h probe th e faint a pparent magnitude s 
(i < 21) include IWolf et~ap (12003D. Ijiang et all (120061). 
IRichards etldl (|2006l ). and lHopkins et alj (|2007bh . After 
comparing all of these r e sults, we decided on a combina- 
tion of the I Jiang et alJ (120061) LF fits (at z < 3.5) and 
the COMBO-17 PDE (|Wolf et"alll200l fits (at z > 3.5) 
for the model LF, and scaled the result to our standard 
cosmology. We re fer the reader to our companion paper 
(|Shen et al.ll2009fl for additional details of the adopted 
LF. 



Our procedure for simulating binary quasars is then as 
follows: 

- Draw a quasar redshift z from the redshift distribu- 
tion dN/dz(i < 21) determined by integrating the 
LF. 

- Given this redshift, draw two magnitudes, i\ and 
«2, for the two members of the binary from the 
apparent magnitude distribution of quasars at this 
redshift, rt(z, < i). 

- If ii < 20.2, locate a quasar in the real SDSS quasar 
sample with a similar value of i and z. Use the real 
quasar photometry for the first quasar. 

- If i\ > 20.2, then locate a quasar in the real SDSS 
quasar sample with well measured colors (i < 19.5) 
and similar z. Make the real quasar fainter by 
scaling all five fluxes / m , m = (u, g,r,i, z) , by 
the implied ratio of i-band fluxes. Add Gaussian 
noise to the scaled fake data with standard devia- 
tion <J m (f m ) consistent with the SDSS noise model 
(see below). 

- Use same procedure to generate photometry for the 
second member of the binary. 

Note that because the aforementioned procedure is 
based on real quasar photometry at a given redshift, the 
effect of Lya forest and Lyman-limit system absorption 
on quasar colors are automatically modeled correctly. 

To model the noise in the photometry as a function 
of flux level in filter m, cr m (f m ), we randomly select 
10 6 stellar objects from the SDSS photometric data and 
apply a running median filter. Given a five-dimensional 
(ugriz) flux vector f m for a simulated quasar, we 
then add random Gaussian noise. Note that our pro- 
cedure assumes no covariance between the flux noise 
in different filt ers, which is a good approximation for 
point sources ( Scrant on et al.l [2005). As an example, 
a randomly chosen simulated quasar at z = 3.2 with 
i = 20.2 has to = (23.00,20.66,20.32,20.20,20.00) 
and a = (0.47,0.025,0.026,0.034,0.082), where 
these values represent real photometric data (since 
i < 20.2). Similarly at z = 4 we simulate an 
object with to = (24.68,21.58,20.28,20.20,20.21) 
and (0.87,0.058,0.027,0.033,0.12). For quasars 
at the same redshifts but with i = 21, again 
chosen at random, we have example objects 
with to = (23.27,21.42,20.91,21.00,20.83) and 
a = (0.66,0.053,0.047,0.072,0.21) at z = 3.2, 
and (25.92,22.83,21.12,21.00,21.10) and a = 
(0.70, 0.17, 0.055, 0.072, 0.26) at z = 4. 

2.4. Locus Distances 

2.4.1. The Quasar Locus 

Although quasars have a wide range of luminosi- 
ties, the majority of unobscured quasars have sim- 
ilar optical /ul t raviole t spectral energy distributions. 
IRichards et al.l (|2001a| ) demonstrated that most quasars 
follow a tight color-redshift relation in the SDSS filter 
system, a property which has bee n exploited to calculate 
photometric redshifts of quasars (IRichards et al.ll2001bl ; 
iBudavari et"ail 120011 : IWeinstein et al.ll2004h . It is thus 
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possible to efficiently target binary quasars by search- 
ing for pairs of objects with similar quasar- like colors 
(|Hennawill200l: iHennawi et al.ll2006bft . But stars are a 
significant contaminant and must be avoided: for i < 19 
stars outnumber quasars on the sky b y a factor of > 50 
(|Yasuda et alJl2001t ISesar et al.ll2007l e.g.) even at high 
galactic latitudes, although stars are less dominant at 
fainter magnitudes (factor ~ 25 for i < 21). The vast 
majority of stars detected by SPSS are on the main se- 
quence (> 98%; iFinlator et all [20051 iHelmi et alj|2003t 
Uuric et aT1l2008l ) and form an extremely tigh t well de- 
fined temperature sequence (jlvezic et al.ll2004l ). referred 
to as the stellar locus in color-color diagrams. Hence 
quasars can be selected efficiently by targeting unre- 
solved objects near the quasar locus but far from the 
stellar locus, exce pt at redshift z » 2.7 and z = 3.4 wher e 
the two loci cross ilF^n1[l99llRichards et alJl2001aL[2006h . 

As objects become progressively fainter, photomet- 
ric measurement errors blur the distinction between the 
quasar and stellar loci. Th e SPSS quasar spectr oscopic 
target selection algorithm (|Richards et al.ll2002af) is not 
well-suited for such faint objects (i ~ 21), and as dis- 
cussed in § 12.21 the PR6 KPE catalog is not well charac- 
terized at these faint magnitudes either. Hence, we intro- 
duce a new algorithm for selecting high redshift quasars 
focusing on a proper statistical treatment o f photomet- 
ric errors. Following IHennawi et al.l (2006b), we define 
a statistic that quantifies the likelihood that an astro- 
nomical object has colors consistent with a given model, 
based on the mean quasar color-redshift relation and the 
stellar locus. 

Given the fluxes of an object in the five SPSS bands 
/data» we as k whether they are consistent with propor- 
tionality to a model 

/data = A /model J CO 

where f™ ode i is a five-dimensional vector of model fluxes, 
and this relationship holds for a single proportionality 
constant in all bands. For the quasar locus, the model 
will be the average run of scaled quasar flux with redshift 
/model ( z )> whereas for the stellar locus the model will be 
the scaled stellar flux as a function of surface tempera- 
ture. 

For a quasar, the maximum likelihood value of the pa- 
rameters A and z, given measured fluxes /^ ta , can be 
determined by x 2 minimization: 

2/4 \ _ [/data ~ ^/modcl( Z )] 2 (n\ 

where a™ ata are the photometric measurement errors, 
and c m 'odei( z ) ^ s the intrinsic ler scatter about the mean 
quasar flux-redshift relation, both presumed to be Gaus- 
sian distributed. The assumption of Gaussian statistics i s 
valid for the measurement errors (|Scranton et al.ll2005T) . 
but iRichards et al.l (|2001af) found a red tail in the dis- 
tribution of quasar colors about the mean, which is due 
to du st extinction (jRichards et al.l 12003: Ho pkins et al.l 
l200l . 

To compute /™ odcl (z) we follow IRichards et alj (|2001aft 
to fit the mean color-redshift relation for quasars, but in- 
stead of fitting colors we rescale all the f m to have the 
same value of f r + f* + f z (and ex™ 1 rescaled accordingly) , 
and fit /model ( z )- This procedure is preferable to fitting 



colors for two reasons. First, for high-redshift quasars 
that have dropped out of a filter, say M-band, forming the 
u — g color degrades a high signal-to-noise ratio g-band 
measurement with the noisy u band measurement. Sec- 
ond, at low signal-to-no ise ratios Pogson m agnitudes or 
SPSS asinh magnitudes (jLupton et al.lll999[ ) pose several 
disadvantages, such as bias in the resulting colors and 
non-Gaussian error distributions, whereas the rescaled 
fluxes are immune from these problems. 

To fit /,™ odol (^), we start with a sample of ~ 
77,000 quasars in t he SPSS PR5 quasar catalog 
([Schneider et al.ll2007h . Restricting attention to quasars 
in the redshift range 0.3 < z < 5.5, removing broad- 
absorption line quasars (which would bias the mean col- 
ors), and restricting attention to quasars with well mea- 
sured redshifts (see Shen et al. 2008) and the most accu- 
rate photometry, results in a sample of 52,000 quasars, 
including 5143 quasars with z > 2.9 and 829 with z > 4. 
Fluxes are g alactic extinction-cor rected following the 
prescription in Schl egel et al.l ([19981) and rescaled to have 
the same value of f + p + f z . The rescaled flux in 
each band is then fit as a function of redshift z using 
a 4th order B-spline with a breakpoint spacing of 0.1 
and outlier rejection. Given this mean flux-redshift re- 
lation /™ ode i (z), we then determine the la scatter about 
it, f m l dei( z ); by applying a running median filter to the 
difference between the data and the fits, f m — /^odei ( z )i 
sorted by redshift. By construction this median curve 
brackets 50% of the data points and varies smoothly with 
redshift. The la scatter is then determined from the me- 
dian curve by rescaling the median by the appropriate 
factor to enclose 68% of the data points for a Gaussian 
distribution. Our estimate of cr™ odel (z) is clearly approx- 
imate since it assumes Gaussian statistics, and includes 
a significant contribution from photometric errors from 
the fainter objects. 

If the redshift of the quasar is known, minimizing x 2 
with respect to A results in four degrees of freedom (the 
number of independent colors that can be formed from 
five fluxes) and x 2 should follow the xl distribution. If 
instead z is unknown, it can be estimated by minimizing 
X 2 - However, the resulting distribution of x 2 values will 
be close to x 2 i but not exactly because the colors are not 
a linear function of z 14 . 

The histograms in the left panel of Figure [1] show the 
resulting distribution of x 2 ( z ) when the /modci( z ) ^ s eval- 
uated at the true redshift of the quasar. The solid (black) 
histogram is the distribution of x 2 for the subset of the 
training sample used to fit the quasar flux-redshift rela- 
tion, which have i < 20.2 and z > 2.5 (6696 quasars). 
The dashed (blue) histogram is the distribution of x 2 
for a sample of ~ 600, 000 fainter simulated quasars with 
i < 21 (sec § 12. 3p . The solid (red) curve shows the x 2 dis- 
tribution. The median value of x 2 for the real data is 3.93 
and it is 3.63 for the simulated data, whereas the median 
of the x 2 distribution is 3.37. Our training set (solid his- 
togram) is broader than the x\ distribution (solid curve) 
and exhibits a tail toward large values, presumably be- 
cause o f the non-Gaussian tai l of the quasar color distri- 
bution (|Richards et al.ll2001af) . For the fainter simulated 

14 Recall that minimizing x 2 with respect to a parameter reduces 
the number of degrees of freedom by one only if \ 2 is linear in that 
parameter. 
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sample, the scatter about the mean becomes more dom- 
inated by photometric measurement errors rather than 
intrinsic color variations, and the distribution is closer 
to xl 

The distribution of x 2 minimized with respect to both 
parameters, A and z, is shown in the right panel of 
Figure [1] This number quantifies the likelihood that a 
quasar has photometric redshift z, and we denote it as 
Xphot- A s expected, the resulting distributions of Xphot 
are significantly narrower than before, with median val- 
ues of 2.19 and 2.17 for the training sample and fainter 
simulations, respectively, which can be compared to the 
median of of the x 2 distribution, 2.38. The performance 
of our photometric redshift algorithm for high redshift 
quasars is illustrated in Figure [2j where we plot photo- 
metric redshift versus spectroscopic redshift for quasars 
in the training sample with 2.5 < z < 4.5. Our pho- 
tometric redshifts are very accurate for high redshift 
quasars, except and z < 2.7 where we tend to under- 
estimate the true redshift. This degeneracy with lower 
photometric redshifts is likely due to poor signal-to-noise 
ratio in the w-band. The primary lever arm for z > 2 
photometric redshifts comes from filters bracketing the 
Lya forest spectral break, which falls in u for z < 2.7. 
For the quasars with 2.9 < z < 4.5 of interest for the 
binary selection in this paper, we find that 83% of the 
real training set quasars and 81% of the fainter simulated 
quasars have \z — z p hot| < 0.2. Computing the standard 
deviation with outlier rejection, we find that a z = 0.14 
for the training set data and o~ z = 0.15 for the fainter 
simulated data. 

2.4.2. The Stellar Locus 

We use a procedure similar to that used for the quasar 
locus to define the stellar locus. First we isolate a set of 
representative stars with accurate photometry using a set 
of SDSS spectroscopic plates on which all point sources 
were targ eted above a flux limit of i < 19 .1 regardless of 
color ( see lAdelman-McCarthv et al.ll2006h . After apply- 
ing cuts to remove objects with noisy photometry and 
stars at the extremes of the color distribution, we ar- 
rive at a set of 13,837 spectroscopically confirmed stars 
with precise SDSS photometry. We parametrize the path 
these stars follow through the five-dimensional SDSS fil- 
ter space using the g — i color, a decent proxy for stel- 
lar temperature. Our fit for /™ odc i(<? — i) uses the same 
methodology as we did for the quasar locus: we rescale 
all stars to have the same value of f r + f % + f z , and each 
band f m is fit with a B- spline as a function of g — i. 
The error c™ 0( j c j((7 — i) is more subtle than for the stel- 
lar locus. The intri nsic width of the stellar locus is so 
small (~ 0.02 mag llvezic et al.l 120041 ) that the scatter 
in our star data about the mean relation is completely 
dominated by photometric measurement errors, even at 
i < 19.1. Hence we simply use o-™ odel (g — i) as a floor on 
the error to prevent extreme values of % 2 for rare cases 
of extremely high signal-to-noise ratio photometry. 

The histogram in the left panel of Figure [3] shows the 
distribution of x 2 from the sample of stars which was 
used to fit the stellar locus, where f^ odc i(g — i) was eval- 
uated at the observed value of the g — i for each star. 
The (red) curve shows the xl distribution. The distribu- 
tion of the data differs significantly from x\- the median 
value of the x 2 f° r the training set is 4.35 compared to 



a median of 3.37 for the x\ distribution. This difference 
is due to a tail of stars with atypical colors, probably 
arising from unusual metallicities or surface gravities. In 
addition, intrinsic stellar colors depend on apparent mag- 
nitude, as the properties of stars change with distance, 
whereas our fits ignore any such dependence. 

The minimum distance to the stellar locus, which 
we designate Xstai-j can be computed by minimizing 
X 2 (A,g — i) with respect to both parameters. The sta- 
tistical significance of an outlier from the stellar locus is 
quantified by Xstar • The right panel of Figure [3] shows 
the distribution of Xstar compared to the x§ distribution. 
Requiring x 2 tar be larger than a specified value is an effec- 
tive means of reducing stellar contamination from quasar 
selection. 

To visualize how quasars and stars objects move 
through the four-dimensional SDSS color space (u — g, 
g — r,r — i,i — z), we plot our fits to the quasar and stellar 
loci in the three color-color diagrams in Figure |4l 

2.5. Faint Photometric Quasar Selection 

We construct a photometric quasar catalog targeting 
objects consistent with the quasar locus, that is small 
Xphof ana - inconsistent with the stellar locus, hence 
large Xstar- Of our simulated i < 21 quasars with 
2.9 < z < 4.5, 94% have xphot < 10 ( see Figured]), 
hence we can achieve a sample with high completeness 
by imposing this cut on quasar locus distance. To de- 
termine the placement of the cut on stellar locus dis- 
tance consider the left panel of Figure [5] The histograms 
show the completeness as a function of redshift for vary- 
ing limits on Xstar as determined from our ~ 600, 000 
simulated quasars with i < 21 and z > 2.9. Specifi- 
cally, from top to bottom the five histograms represent 
Xstar ^ [0j 5, 10, 15, 40]. In addition to the cut on Xstar we 
have imposed Xphot < 10 and several other auxiliary cuts 
(described in detail below) which have less of an impact 
on the overall completeness than Xphot ■ 

If there is no constraint on Xstar (upper black curves 
in Figure [5|) the other cuts result in a total completeness 
of 82% in the redshift interval 2.9 < z < 4.5 Increas- 
ing the cut on Xstar forces quasars to be higher signifi- 
cance outliers from the stellar locus, thus decreasing the 
completeness. The decrease is particularly strong near 
z ~ 3.4, because the stellar and quasar loci nearly over- 
lap at this redshift (see Figure [4j. Actually, the loci 
are still distinct in the ugr color-color diagram, but for 
fainter objects photometric errors blur this distinction 
because of low signal-to-noise ratio in the u-band. We 
choose Xstar > 10 (blue curves in Figure [5]) as a compro- 
mise between completeness and the resulting efficiency 
of our search, which is quantified below. 

Our binary quasar selection algorithm contains several 
steps. The area we consider for our search is the entire 
SDSS DR6 with an added constraint on galactic latitude 
|6| > 25° to avoid contamination from highly reddened 
sources, resulting in a final survey area of 8142 deg 2 . 
In this area we first consider all objects that fulfill the 
set of conditions below to be photometric quasar candi- 
dates. In what follows we describe and motivate each 
criterion and parenthetically quote the resulting com- 
pleteness, determined from our simulations (i < 21 and 
2.9 < z < 4.5), if each criterion is applied sequentially: 
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1. — Histograms of quasar locus distance statistics. The left panel shows the distribution of x 2 ( z )i where redshift was taken to 
be known and eqn. [2] was minimized for A only. The right panel shows the distribution of Xphot' ' ,e ' mm i m i ze d with respect to both 
amplitude A and redshift z. The solid (black) histograms are distributions for the 6696 i < 20.2 training sample quasars at z > 2.5 used 
to fit the fiux-redshift relation. The dashed (blue) histograms are from ~ 600, 000 fainter (i < 21) simulated quasars at z > 2.5. The solid 
(red) curve shows the x\ distribution in the left panel and the x| in the left panel. 

objects (> 99%) 

E) a photometric redshift dependent magnitude prior 
is imposed discarding stars which are stellar locus 
outliers but too bright to be quasars (99%) 

F) exclude objects with colors consistent with be- 
ing white dwarf + M -star pairs (see Figure |H 
iRichards et alj l2001al) . a significant contaminant 
(99%) 




-spec 



Fig. 2. — Photometric redshift versus spectroscopic redshift for 
quasars with 2.5 < z < 4.5 in our training sample (i < 20.2). For 
z > 2.9, 83% of the training set quasars and 81% of the fainter 
(i < 21) simulated quasars have |z — z p i lot | < 0.2. The standard 
deviation (with outlier rejection) is <r z = 0.14 for the training set 
and <r z = 0.15 for the fainter simulated data. 

A) the image must be classified as STAR by the pho- 
tometric pipeline (> 99%) 

B) various cuts on flags produced by the SDSS pho- 
tometric pipeline, similar to those used by spec- 
trosco pic quasar target selection (|Richards et al.l 
l2002aD . must be satisfied (> 99%) 

C) The z-band photometric error must satisfy o\ < 0.2, 
excluding noisy data or spurious deblends (> 99%) 

D) the galactic extinction in the i-band must be A\ < 
0.3 reducing contamination from highly reddened 



G) the apparent magnitude must satisfy i < 
(96%) 15 



21 



H) the g — r color must be redward of the stellar locus 
in g—r vs r—i color-color diagram (see Figure 3]) for 
3.5 < Zphot < 4.5, significantly reducing contami- 
nation from faint red stars in this redshift range 
(92%) 

I) colors must be consistent with the quasar locus and 



have 



Xpho 



< 10 



J) the photometric redshift must be in the range 
2.85 < Zphot, thus excluding redshifts where selec- 
tion is less efficient (see below, 82%) 

K) colors must be inconsistent with the stellar locus 
and have xLr > 10 (64%) 

Thus our final completeness for selecting individual 
quasars with i <21 and 2.9 < z < 4.5 is 64%, with a red- 
shift dependence given by the blue line in the left hand 
panel of Figure[5l Our selection is much more efficient for 
brighter higher signal-to-noise ratio sources since stellar 
locus outliers can be identified at higher statistical signif- 
icance. The right panel of Figure [5] shows completeness 
as a function of redshift for varying limits on Xstar; but 
for brighter quasars with i < 20.2 satisfying the flux limit 

15 This reduction in completeness is due to Malmquist bias, given 
photometric errors. 
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Fig. 3. — Histograms of stellar locus distance statistics for 13,837 spectroscopically confirmed stars with i < 19.1. The left panel shows 
the distribution of x 2 (fl — *) if eqn. [2]is minimized for amplitude A only. The right panel shows the distribution of the minimum distance 
to the stellar locus, whereby \ 2 {A, g — i) is minimized with respect to both parameters. The (red) curves shows the \\ an d X§ distributions 
in the left and right panels, respectively. 



for the SDSS spectroscopic sample. The resulting com- 
pleteness for i < 20.2 quasars is 83%, compared to 64% 
for the fainter i < 21 flux limit. 

The efficiency of quasar targeting is another impor- 
tant criterion. Our spectroscopic follow-up (see § 13. ip 
does not allow us to quantify this because we observed 
the best candidates first and our follow-up is far from 
complete. In principle the efficiency could be computed 
if we knew the number densities of the contaminants on 
the sky. However, the number density of faint stars in 
the relevant regions of color-space is not well constrained 
and is a strong function of Galactic latitude. Further- 
more, faint red unresolved galaxies may also be a signifi- 
cant contaminant and estimating their number density is 
challenging. Instead, we obtain a rough estimate of the 
efficiency of our photometric quasar selection by com- 
paring the number density of quasars predicted from the 
luminosity function, to the total number of sources iden- 
tified in our catalog. 

Specifically, we use the luminosity function (see discus- 
sion in § I2.3P to compute the number counts of quasars 
in redshift bins, and multiply by the completeness de- 
termined from our simulations (see Fig. O, to determine 
the total number of quasars expected from our selection 
as a function of redshift, Nqso(z). To mimic the ef- 
fect of photometric redshift error we convolve Nqso( z ) 
with a Gaussian with a z = 0.16, the standard deviation 
of our photometric redshifts for z > 2.5 (see Figure [2]). 
Then we can estimate the efficiency e = -/VQso(Zphot)/ 
^candidates (^phot) j which provides a rough estimate of the 
fraction of real quasars in our catalog as a function of 
photometric redshift. 

The efficiency of our photometric catalog deduced in 
this way is shown in Figure [6] We have expanded the 
photometric redshift range of our selection to 2.5 < 
•Zphot < 5.3 for this figure to illustrate the efficiency out- 
side the limits of our survey (2.9 < z < 4.5), and to 
avoid edge effects in the photometric redshift convolu- 
tion. Otherwise the criteria applied are exactly those in 
described in § 12.51 For z < 4, the photometric selection 
efficiency is > 50%, except for dips at the problematic 
redshifts z ~ 2.8 and z ~ 3.4. The efficiency reaches 1.6, 



and is thus unphysically above unity at z p hot ~ 3.8, so we 
caution that our estimates are uncertain by at least this 
factor. For z p hot ^ 4.2, the efficiency drops precipitously, 
reaching a minimum of ~ 1% at z ~ 4.4. The contam- 
inant at these redshifts is a rare population of red stars 
which have colors distinct from the stellar locus defined 
in 12.4.21 (see Figure |4|) , but which are significantly more 
abundant than high redshift quasars. Because these red 
stars occur in much lower numbers than the stars on the 
stellar locus, they were not included in our parametriza- 
tion in § |2~! 

To improve the efficiency of our binary search we com- 
pare candidates to t he SPSS spectrosco pic quasars, the 
FIRST radio survey ([Becker et a l. 1995), and the ph oto- 
metric KDE catalog (see § !2.2llRichards et al.ll2009l ). as 
we describe below. 

2.6. Binary Quasar Selection 

To select binary quasars we first apply the cuts de- 
scribed in £j |2.5l to the SDSS photometric data to identify 
an initial photometric quasar catalog. We then search for 
pairs of objects in this catalog that satisfy the following 
additional criteria: 

I) the angular separation must be A < 120", which 
projects to a comoving (proper) transverse dis- 
tance of R ± = 2.7 hr 1 Mpc (0.96 Mpc) and R±_ = 
3.1 h,- 1 Mpc (0.87 Mpc) at z = 3 and z = 4, re- 
spectively 

II) the quasar candidates must have similar photomet- 
ric redshifts |z p hoti — -Zphot2| < 0.4 (if applied to 
simulated quasar pairs (2.9 < z < 4.5) satisfying 
the criteria from 12.51 only 4% are rejected) 

III) one member of the pair must either be: con- 
firmed by the main SDSS spectroscopic survey to 
be a quasar at z > 2.9, or a FIRST radio source 
dBecker et al.lH99gV. or a member of th e KDE pho- 
tometric catalog (Richards et al. 200§) 



In practice, the KDE completely dominates the auxil- 
iary catalog criterion III); the SDSS spectra and FIRST 
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Fig. 4. — Color-color diagrams illustrating the stellar locus and the quasar locus. The gray shaded region denotes stellar density for the 
~ 14, 000 stars used to fit the stellar locus, with darker regions corresponding to higher density. The light blue curve is our fit to the stellar 
locus. The ~ 5000 z > 2.9 (i < 20.2) quasars in our training set are plotted as redshift color-coded points according to the color-bar at the 
lower right (lower redshift quasars are omitted for clarity). Higher redshift objects are plotted as larger points. The quasar locus is shown 
by the dashed black line, similarly color-coded to in dica te redshift. The magenta square box corresponds to a region exclu ded from SPSS 
quasar selection, which we similarly exclude (see § 12.51 1. because of contamination from white dwarf + M-star pairs (see Richards ct al. 
2001a) 



sources account for only ~ 2% of matches. As we do not 
know the completeness of the KDE catalog to i ~ 21, we 
cannot precisely quantify how III) impacts our overall 
completeness for selecting binaries, but this uncertainty 
has a very small effect, as we discuss in more detail below. 

The completeness of the binary selection as a func- 
tion of redshift is required for the clustering analysis in 



Paper II. For concrctcncss, consider the particular red- 
shift range z = 3.9 — 4.0, where the completeness for 
selecting individual quasars is 77% (Figure [5]) . Without 
imposing criterion III) , the completeness for selecting bi- 
naries would be 53%, the square of the individual quasar 
completeness, but with a 4% reduction due to criterion 
II). We naively guess that the combined KDE+ SDSS 
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ij I2.5I have been applied except criterion K on Xstar> which varies from top to bottom as Xstar — [0,5,10,15,40]. The left panel shows 
completeness determined from ~ 600, 000 simulated quasars with i < 21. The right panel shows the completeness if we restrict to quasars 
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Fig. 6. — Estimated efficiency of the faint phot omet ric quasar 
catalog as a function of redshift. The criteria in S I2.5I have been 
applied, except that the photometric redshift range has been ex- 
panded to 2.5 < Zphot < 5.3. For z < 4, the selection efficiency is 
> 50%, except for dips at the problematic redshifts z ~ 2.8 and 
z ~ 3.4, where quasar colors intersect the stellar locus (see Fig- 
ure The efficiency reaches 1.6, and is thus unphysically above 
unity at Zp^ot ~ 3.8, so we caution that our estimates are uncer- 
tain by at least this factor. For Zp^ot ^ 4.2, the efficiency drops 
precipitously, reaching a minimum of ~ 1% at z ~ 4.4 



+FIRST sample has the same completeness as our pho- 
tometric selection (for individual quasars) in this red- 
shift bin. Then requiring that one member of each pair 
match these auxiliary catalogs implies that we will re- 
cover 1 — (1 — 0.77) 2 = 94% of the binaries. Assuming 
the objects missed by our photometric selection and the 
auxiliary catalogs are uncorrelated, the total resulting 
completeness for binaries is 0.94 x 0.53 or 50%. More 
importantly, we do not need to characterize the KDE cat- 
alog very precisely, since even a ~ 20% reduction in com- 
pleteness (highly unlikely) would only amount to < 10% 
error in characterizing our overall completeness for bi- 
naries. The other sources of statistical and systematic 
error in the clustering measurement in Paper II signifi- 
cantly overwhelm this conservative estimate of the error 
in our completeness. Thus we make the Ansatz that 



Fig. 7. — Completeness for photometrically selecting binary 
quasars as a function of redshift determined by applying the crite- 
ria in §[2]~ 300, 000 simulated binaries with i < 21. 

the auxiliary catalogs (KDE+ SDSS +FIRST) have the 
same completeness as our individual photometric quasar 
selection 16 (see Figured]). Figure [7] shows the final com- 
pleteness of our binary quasar selection as a function of 
redshift with this assumption. 

2.7. Survey Status 

Our sample of 27 z > 2.9 binaries (24 of them new dis- 
coveries) is presented in § |4] Here we discuss the status 
of spectroscopic follow-up observations of binary candi- 
dates in our survey. The redshift and angular distribu- 
tion of our sample, shown in Figure [9j reflects specific 
biases in our follow-up observations. First, we tended 
to observe small separation pairs first, because they are 
more interesting for studies of the intervening IGM and 
contribute more significantly to the quasar clustering sig- 
nal (see Paper II) . Thus our follow-up is significantly less 
complete for larger splittings. As a result we limit our 
clustering analysis in Paper II to 9 < 60", and the status 
of our follow-up observations is quantified only for these 

16 We do not impose criterion J) (see § I2.5|1 in this estimate since 
no photometric redshift cuts were made on the KDE catalog. 
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TABLE 1 

Status of Quasar Pair Candidates with 
A6 < 60" 



Status 




%>hot < 3.5 | 


z phot > 3.5 | 


Total 


Observed 




17 


29 


46 


HIGH 




28 


27 


55 


MEDIUM 




16 


32 


48 


LOW 




156 


14 


170 


Total 


217 


102 


319 


Note. - 




The number 


of quasar pair can- 



didates are listed and photometric redshifts of 
both members were averaged, allowing us to 
split the candidates into two redshift bins about 
2 p hot = 3.5. Binary candidates which were tar- 
geted for follow-up observations are labeled as 
'Observed'. The remaining candidates have a 
priority of HIGH, MEDIUM, or LOW. 
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Fig. 8. — Distribution of photometric redshifts of both members 
of the 340 quasar pair candidates with 8 < 60" selected by our 
survey. 

separations. Of the 27 binaries in our sample, 22 have 
< 60", but only 15 of these satisfy the selection criteria 
laid out in § [5] (the other seven were mostly discovered as 
pair candidates from the KDE catalog). The second no- 
ticeable trend in Figure [9] is that there are very few pairs 
with redshifts z = 3.2 — 3.5 and at z > 4.1. Figure[5]shows 
the distribution of mean photometric redshifts of the 340 
(non-spurious) quasar pair candidates with 9 < 60" se- 
lected by our survey. As expected, the largest number 
of candidates occur at redshifts where the efficiency is 
low in Figure [HI and thus at the corresponding redshifts 
we find fewer binaries in Figure [9j both because fewer 
candidates at these redshifts were targeted for follow-up 
spectroscopy and also the success rate is much lower. 

We ran our binary selection on 8142 deg 2 of SDSS 
imaging, which resulted in 431 pair candidates with 9 < 
60". The SDSS images of all candidate quasar pairs were 
visually inspected to reject bad imaging data. The pri- 
mary contaminants are regions around diffraction spikes 
from bright stars, challenging deblends, and fields con- 
taining a globular cluster where the stellar density is in- 
credibly high. Of the 431 initial pair candidates, 91 were 
spurious, leaving 340 candidate quasar pairs. For some 
of the candidates, one member of the pair was observed 
by the SDSS spectroscopic survey and the spectra were 
examined in these cases to check for mis-identifications; 



there were 21 objects which were not z > 2.9 quasars, 
leaving 319 candidates for follow-up. Color-color dia- 
grams for each candidate were also inspected, and each 
candidate was assigned a rank (HIGH, MEDIUM, LOW) 
for follow-up observations. These rankings are subjec- 
tive, reflecting the likelihood that the candidates were 
quasars based on their colors, the fidelity of the pho- 
tometry (noisy data or suspicious de-blends received a 
lower priority), and if both members were members of 
either the KDE, FIRST, or SDSS spectroscopic catalogs. 
In addition smaller separation pairs were given a higher 
priority. 

The result of a successful spectroscopic follow-up ob- 
servation of a quasar pair candidate falls into one of four 
categories: (1) a quasar-quasar pair at the same redshift 
(2) a projected pair of quasars at different redshifts (3) a 
quasar-non-quasar pair (i.e. either a star or a galaxy) (4) 
a pair of non-quasars. Of the 319 candidates for spectro- 
scopic follow-up, 47 have one member confirmed to be 
a z > 2.9 quasar by SDSS spectroscopy and 9 have one 
member in the FIRST catalog (which was not targeted 
by the SDSS spectroscopic survey). For only 37 of these 
319 candidates are both objects members of the KDE 
catalog. 

The clustering analysis in Paper II splits our binary 
sample into two redshift bins (2.9 < z < 3.5 and 
3.5 < z < 4.5), and we describe the status of our follow- 
up observations similarly in bins of mean photometric 
redshift. Table [T] summarizes the status of all of the 
319 candidates for spectroscopic follow-up. To date, our 
follow-up spectroscopy of binary candidates is about half 
finished, with 46 HIGH priority candidates observed and 
a total of 55 HIGH priority candidates remaining. In- 
cluding the medium priority candidates in the tally im- 
plies we are only one third finished, but our experience 
is that these lower priority candidates have a much lower 
success rate. Of 170 LOW priority targets, 132 have one 
member with z p hot < 3.2, indicating high contamination 
where the quasar and stellar loci cross at z ~ 2.8 (see 
Figures [H [HI and [5]) . We caution that the fraction of 
candidates observed thus far is a poor predictor of the 
number of new binaries among our remaining targets, 
both because we observed the best candidates first and 
because the success rate for the lower priority candidates 
is unknown. 

Indeed our tendency to observe the best candidates 
first is largely responsible for our high efficiency for find- 
ing binary quasars. Out of the 46 candidates observed to 
date with 9 < 60", 16 were quasars at the same redshift: 
15 correspond to binaries published in Table [3] 17 and 
the other wa s the gravitationa l lens SDSS J1400+3134 
at z = 3.32 (Inad a et al.l 12008? ) . The remainder of our 
follow-up observations resulted in six projected quasar 
pairs, 15 quasar- non-quasar pairs, and nine pairs of non- 
quasars. 

3. OBSERVATIONS 

In this section we describe the observations for our 
high-redshift binary survey. In § 13.11 we discuss the 
follow-up spectroscopy of pair candidates and in § 13.21 
we explain our algorithm for estimating quasar systemic 

17 Note that only 15 of the total 27 binaries published in Table|3] 
satsify both 6 < 60" and the criteria in § I2.5l and § 12.61 
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TABLE 2 

)uasar Spectroscopy 



Telescope 


Instrument 


Spectrograph Type 


Spectral Coverage FWHM 


Dates 


Reference 


APO 3.5m 


DIS 


Double 


3800 - 10, 000 


440/270 


Nov. - Dec, 2005 


1 


MMT 


Red Channel 


Single 


5000 - 10, 000 


1000 


Feb. 19-21, 2006 


2 


MMT 


Blue Channel 


Single 


3200 - 8400 


600 


Dec. 13-15, 2006 and Mar. 25, 2007 


3 


Palomar 200-inch 


DBSP 


Double 


3100 - 9300 


900/550 


Sep. 2007 - Apr. 2009 


4 


Mayall 4m 


RC-Spectrograph 


Single 


3600 - 9200 


325 


Feb. 9-11, 2008 and Jun. 7-10, 2008 


5 


Keck II 10m 


ESI 


Echellette 


4000 - 10, 000 


60 


Nov. 2006 - Jul. 2008 


6 


Note. — (1) ? (2) 


? (3) ? (4)IOke & 


: Gunnl 119821) (5) ? 


(6)ISheinis et al.l 1120021) 









rcdshifts. As we elaborate below, optical spectra cover- 
ing rest-frame ultraviolet emission lines yield quasar red- 
shifts with large velocity uncertainties (~ 1000 km s _1 ). 
Thus redshifts estimation must be treated carefully in 
order to distinguish true binary quasars from projected 
pairs. 

3.1. Spectroscopic Observations 

Discovery spectra of our binary candidates were ob- 
tained with the Astrophysical Research Consortium 3.5m 
telescope at the Apache Point Observatory (APO), the 
6.5m Multiple Mirror Telescope (MMT), the Palomar 
200-inch telescope, and the Mayall 4m telescope at Kitt 
Peak National Observatory. For the confirmation obser- 
vations we used low-resolution gratings and we typically 
oriented the slit at the position angle between the two 
quasars so that both members of the pair could be ob- 
served simultaneously. Exposure times varied with the 
telescope and observing conditions, but ranged from 600s 
for the bright candidates on the larger telescopes to 2400s 
for the faintest on the smaller telescopes. The relevant 
details of the confirmation observations are summarized 
in Tabled 

A subset of the confirmed binary quasars was later ob- 
served at higher resolution and signal-to-no ise ratio using 
the Echelle Spectrograph and Imager (ESI, ISheinis et alJ 
2002) on the Keck II 10m telescope on Mauna Kea dur- 
ing a number of runs between November 2006 and July 
2008. The goal of the ESI observations was to mea- 
sure small scale transverse correlations of the Lya for- 
est and C IV metal line a bsorbers to reveal the small 
scale structure of the IGM (|Hennawill2004t lEllison etall 
I2007D. characterize the size of metal-e nriched regions 
(|Eliison et ail 120071 : iMartin et al.ll2009h . and constrain 
the dark energy density of the Universe with the 
Lya fo rest Alcock-Paczyhski test (|Alcock fc Paczvnski 
19791: IMcDonald fc Miralda-Escudel 119991 : iHui et al 
1999* IMcDonald! I2003D . Our binary quasar search 



also uncovered many high-redshift projected pairs of 
quasars (see § 12. 7p , which we also observed with ESI 
to st udy foreground quasar environments in absorp- 
tion jH cnnawi ct al. 2006a; Hennawi fe Prochaskall2007l : 
Prochaska fc Henn awi 2009) and the transverse pr oxim- 
ity effect fe.g. lCrofdl2004lKirkman fc Tvtlerll2008D . 

ESI is an echellette spectrograph with ten orders on the 
CCD, giving continuous spectral coverage from 4000A- 
10000A . The resolving power is nearly constant across 
the orders. Our observations exclusively used the 1.0" 
slit, resulting in a resolution of R ~ 5000 or FWHM~ 
60 km s~ . As the ESI slit is 20" long, the instrument 
was rotated to observe both quasars simultaneously for 



pairs with AO < 15", whereas wider separation pairs 
were observed individually with the position angle set 
to the parallactic angle. The signal-to-noise ratio of the 
resulting spectra vary, but typically peak at S/N ~ 15 — 
20 per pixel between the Lya and C IV emission lines 
(around 7000A) , with lower S/N at bluer and redder 
wavelengths. Total exposure times varied from lhr for 
our brighter targets to 6hr for our faintest, with typical 
exposures around 2hr. 

The low resolution discovery spectra and ESI spec- 
tra were reduced using the LowRedux 18 and ESIRedux 19 
data reduction pipelines, respectively, written in the In- 
teractive Data Language (IDL). 

3.2. Estimating Systemic Redshifts 

Quasar redshifts determined from the rest-frame 
ultraviolet emission lines (redshifted into the opti- 
cal at z > 3) can differ by over one thousand 
kilometers per second from the systemic frame, be- 
cause of outflowing/inflo wing material in the broad 
line r egions of quasars (iGaskelll Il982t iTvtler fc Fan! 
fl99l IVanden Ber k et alJ 120011 : iRichards et alJ l2002bF 
iShen et alll2008bl ). A redshift determined from the nar- 
row forbidden emission lines, such as [O III] A5007, 
or Balmer lines such as Ha A6563 are muc h better 
predictors of systemic redshift (Boroson 2005), but at 
z ^ 3, spectral coverage of these lines would require 
observations in the near- infrared. Instead , we follow 
the approach described in lShen et alJ (|2007| ), and deter- 
mine the r e dshift s of quasars from our optical spectra. 
IShen et all (|2007[ ) measured the correlation between the 
relative shifts of the high-ionization emission lines Si IV, 
C IV, C III, and the shift between these respective lines 
and the system ic frame, as traced (a pproximately) by 
the Mg II line ([Richards et all l2002bh . These correla- 
tions can then be used to estimate the systemic redshift 
and the associated error, from high-ionization line cen- 
ters. As an operational definition, we consider quasar 
pairs with velocity differences of \Av\ < 2000 km s _1 
to be at the same redshift, since this brackets the range 
of velocity differences caused by both peculiar velocities, 
which could be as large as ~ 500 km s _1 if binary quasars 
reside in rich environments, and systemic redshift uncer- 
tainties, which can be larger than ~ 1000 km s _1 . 

Our quasar spectra come from a variety of instruments 
(see Table [5]) and have varying signal-to-noise ratios and 
spectral coverage. The ESI spectra are always used for 
redshift determination when available, otherwise we use 

18 http:/ /www. ucolick.org/~xavier/LowRedux/indcx. html 

19 http:/ /www. ucolick.org/~xavier/ESIRcdux/indcx. html 
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Fig. 9. — Distribution of redshift and proper transverse sepa- 
ration for 27 binary quasars in the SDSS imaging footprint. The 
(black) circles indicate the subset of objects which meet the selec- 
tion criteria in §[2] whereas the (red) squares do not. Previously 
known binaries are indicated by filled symbols. The dotted and 
dashed curves are the proper transverse distance corresponding to 
the fiber collision limit of the SDSS (8 = 55"), and the angular 
separation limit of our binary search (9 = 120"), respectively. 

SDSS spectra or the low signal-to-noise ratio discovery 
spectra. For noisy spectra which can have intrinsic ab- 
sorption features, flux weighted centering often yields er- 
roneous results for emission line centers. For this reason, 
we adop t the more robu s t line-c entering proc edure intro- 
duced in lHennawi et al.l (|2006af ). and used bv lShen et alJ 
(|2007f h For those spectra where prominent UV emis- 
sion lines cannot be centered because of low signal-to- 
noise ratio, limited spectral coverage, or strong intrinsic 
absorption features, the redshift is measured using the 
peak of the Lya emission line measured after smoothing 
with a 1000 km s _1 boxcar. A shift is then applied be- 
tween the Lya redshift and the system i c fram e, which 
was measured on average bv lShen et ail IpOOl . 

4. THE BINARY QUASAR SAMPLE 

We present a sample of 24 new high redshift, 2.9 < 
z < 4.3, binary quasars with (proper) transverse sep- 
arations 10 kpc < Rj_ < 650 kpc. Eight mem- 
bers of this sample are very close binaries with R± < 
100 kpc, and four of these are at z > 3.5. Ta- 
ble [3] lists relevant quantities for a total of 27 bina- 
ries in the SDSS imaging area footprint, including three 
previously known pairs. These are SDSS J 1116+4118 
and SDSSJ 1 546+51 34, which were published in 
iHennawi et all (l2006bD, and SDSSJ 1439-003 3, discov- 
ered serendipitously by ISchneider et ail (|2000f) . Of this 
sample of binaries, 22 of 27 have angular separations 9 < 
55", below the SDSS fiber collision scale. For two of the 
binaries both objects were observed by the SDSS spectro- 
scopic survey: SDSSJ 1016+4040 has angular separation 
9 = 68.2", i.e. larger than the fiber collision limit, and 
SDSSJ 1116+4118 (6 = 1 3.8") was observed by overlap- 
ping spectroscopic plates (|Hennawi et al.ll2b06b[ ). 

Several of the binary quasars listed in Table [3] 
do not meet all of the criteria laid out in § [3J As 
we describe further below, PSS 1315+2924, one of 
the closest systems in our sample, was selected by 
a different method altogether as its faint compan- 
ion is not even detected in the SDSS imaging data. 



The remaining binaries which fail to meet these cri- 
teria are: SDSSJ 0004-0844, SDSSJ 0829+2927, 
SDSSJ 1150+4659, SDSSJ 1159+3426, 

SDSSJ 1248+1957, and SDSSJ 1307 +0422, as well 
as SD SSJ 1439-0033 discovered by ISchneider et~aTl 
(2000). Many of these objects were targeted for 
follow-up because they were pair candidates in the 
KDE photometric catalog. In Paper II, we restrict the 
clustering analysis to the subset of pairs which meet 
the selection criteria in § [5J for which we calibrated the 
completeness of our search. 

The distribution of redshifts and proper transverse sep- 
arations probed by the 27 binary quasars in the SDSS 
footprint is illustrated by the scatter plot in Figure O 
The relative paucity of pairs with 9 > 60" results from 
the incompleteness of our spectroscopic follow-up cam- 
paign. The lack of pairs with z = 3.2 — 3.5 and at z > 4.1 
occurs because photometric quasar selection is inefficient 
at these redshifts (see discussion in § 12. 7|) . 

Spectra of selected members of our binary sample are 
shown in Figures [TO] (2.9 < z < 3.5) and [ft] (z > 3.5). 
All of the spectra in these figures are from ESI with the 
exception of SDSSJ 1116+4118, for which we show the 
SDSS spectra. 

4.1. The Discovery of PSS 1315+2924 

The faint quasar companion to the bright quasar 
PSS 1315+2924 was discovered during a pilo t search for 
proto-clusters associated with z > 4 quasars (jDjorgovskil 
Il999t iDiorgovski et al.l 119991 ) . Two different strategies 
were employed in this search. In the first, i?-band snap- 
shot images were taken of about twenty z ~ 4 quasars 
to moderate depth R < 25 u sing the Low Re solution 
Imaging Spectrograph (LRIS; lOke et all 119951 ) on the 
Keck I telescope. If a companion was found within 
~ 10" of the quasar it was spectroscopically observed 
with LRIS in longslit mode. For the second approach, 
deep imaging was obtained in BRI (R < 26) for a sam- 
ple of about 10 z ~ 4 quasars, again using LRIS. Color- 
selection was used to search for dropout candidates (at 
these redshifts, the continuum drop is dominated by the 
Lya forest, rather than the Lyman break, which is used 
to select galaxies at z ~ 2 — 3.5). These candidates 
were then followed up with multislit spectroscopy using 
LRIS in spectroscopic mode. The faint R ~ 24.1 com- 
panion PSS 1315+2924B (see Table G| was found from 
this color-selection approach, and spectra of both quasars 
were obtained on 14 April 1999 UT with LRIS, and re- 
duced using standard techniques. In addition, this proto- 
cluster survey uncovered about two dozen star-forming 
galaxies at small angular separations 9 < 10" from the 
quasars. A similar survey was also carried out around 
z ~ 5 — 6 quasars, which led to the discovery of a com- 
panion z = 5.02 qu asar 196" away from a high redshift 
quasar at z = 4.96 (|Diorgovski et al.ll2003l ). 

4.2. Contamination by Gravitational Lenses 

It is possible that some of the quasar pairs with im- 
age splittings < 20" in our sample are wide separation 
strong gravitational lenses rather than binary quasars. 
Indeed, several gravitationally lensed quasars with im- 
age splittings 9 > 3" have been discovered in the SDSS 
(llnada et al.ll2003t lOguri et al.ll2004 120051 : llnada et all 
2008). In particular, our binary search recovered one 
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TABLE 3 

High Redshift Binary Quasars in the SDSS Imaging Footprint 



IN cLITIC 




RA 
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R 


SDSS J0004-0844A 
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01 


50 
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11 
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20 
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20 
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.9 


2 
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1450 
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.5 


1 


.024 


1450 


250 


44.1 


320 


SDSS J0959+1033B 


09 


oy 


UU 


.139 


4-1 


99 


25. 


.02 


25. 


.8(5 


21 


.47 


20 


.00 


19.56 


19 


.50 


24. 


,5 




7 




n9n 
uzu 


1500 








SDSS J1016+4040A 


10 


1(5 


01. 


.509 


+40 


40 


52. 


.89 


21. 


2(5 


19 


.42 


19 


.22 


19.10 


18 


.81 


109 


.8 


8 


.1 


2 


.996 


520 


1030 


68.2 


548 


SDSS J1016+4040B 


10 


I F, 
±u 


uo. 


.842 


4-40 


10 


05 


.80 


22. 


12 


20 


.3(5 


20 


.06 


20.10 


19 


.92 


49. 


.7 




Q 
.o 


2 


yoo 


1450 








SDSS J1021+1112A 


10 


21 


16. 


.982 


+ 11 


12 


27. 


55 


25. 


.23 


21 


.99 


20 


.50 


20.35 


20 


.18 


26 


.7 





.3 


3 


.805 


520 


1470 


7.6 


56 


SDSS J1021+1112B 


10 


Z± 


I K 


.468 


4-1 1 


1 9 
1Z 


27. 


.83 


25. 


46 


22 


.3(5 


20 


.62 


20.73 


20 


.26 


58. 


.2 


F, 



.i 

,i 


9 


S9Q 

ozy 


630 








SDSS J1053+5001A 


10 


53 


20 


.150 


+50 


01 


46. 


02 


23. 


.77 


21 


.05 


20 


.72 


20.72 


20 


.64 


18. 


.2 





.4 


3. 


.078 


630 


490 


2.1 


17 


SDSS J1053+5001B 


10 


53 


20 


.041 


+50 


01 


47. 


.81 


23. 


35 


21 


.10 


20 


.81 


20.88 


21 


.02 


19. 


.4 




5 




085 


520 








SDSS J1054+0215A 


10 


54 


34. 


.174 


+02 


15 


51. 


95 


23. 


.03 


20 


.22 


18 


.69 


18.61 


18 


.75 


244. 


.9 


6 


.2 


3. 


.984 


630 


1120 


88.3 


644 


SDSS J1054+0216B 


10 


O'l 


9 Si 

Zo 


.515 


4-09 


I f ; ; 
±o 


16. 


37 


23. 


21 


20 


.58 


19 


.41 


19.10 


18 


.97 


12. 


.9 


r, 
O 


Q 
. 9 


Q 

o. 


yuu 


790 








SDSS J1116+4118A 


11 


1(5 


11. 


.740 


+41 


18 


21 


.51 


20. 


.30 


18 


.49 


18 


.13 


17.93 


17 


.95 


87. 


.0 


6 


.0 


2. 


.982 


520 


1840 


13.8 


111 


SDSS J1116+4118B 


11 


I P, 
±t> 


1 


.689 


4-41 


1 R 


14. 


41 


21. 


29 


19 


.41 


19 


.14 


19.01 


19 


.02 


87. 


.9 




7 


Q 
9 


nnfi 
uuu 


520 








SDSS J1118+0202A 


11 


18 


22 


.764 


+02 


02 


36. 


35 


24. 


.82 


21 


.13 


19 


.81 


19.77 


19 


.8(5 


75. 


.0 





.8 


3. 


.939 


790 





83.8 


613 


SDSS J1118+0201B 


11 




1 7 


.920 


4-09 


U 1 


54. 


.48 


2(5. 


78 


21 


.83 


20 


.62 


20.56 


20 


.22 


27 


.3 


o 


Q 
. 9 


Q 
9 


Q^Q 
yoy 


2000 








SDSS J1150+4659A 


11 


50 


55 


.751 


+46 


59 


41. 


.96 


22. 


43 


20 


.52 


20 


.34 


20.56 


20 


.16 


66. 


.7 


12 


.2 


3. 


.005 


710 


140 


34.9 


280 


SDSS J1150+4659B 


11 


50 


52. 


.353 


+46 


•59 


39. 


.44 


23. 


21 


21 


.17 


20 


.79 


20.91 


20 


.91 


2(5 


.7 


3 


,1 


3. 


.003 


790 








SDSS J1159+3426A 


11 


59 


06. 


.632 


+34 


26 


48. 


.24 


23. 


1(5 


20 


.34 


20 


.00 


19.93 


19 


.87 


39 


.0 





.0 


3. 


.135 


790 


1340 


51.2 


405 


SDSS J1159+3427B 


11 


59 


02 


.974 


+34 


27 


12 


.31 


24. 


35 


21 


.21 


20 


.97 


21.15 


21 


.07 


21 


.9 


5 


.7 


3. 


.154 


790 








SDSS J1248+1957A 


12 


18 


46. 


.012 


4-1 Q 


•57 


16. 


.8(5 


22. 


.55 


20 


.41 


19 


.18 


19.44 


19 


.61 


81 


.7 


12 


I 


3. 


.872 


520 


480 


64.8 


477 


bDbb J 1248+1950-0 


12 


48 


42. 


237 


+19 


56 


39 


.90 


24. 


39 


21 


.75 


20 


.16 


20.01 


20 


.10 


91. 


.8 


1 


,2 


3 


.864 


520 








SDSS J1251+2715A 


12 


51 


23 


.788 


+27 

! Z 1 


15 


24. 


61 


23. 


.14 


21 


.29 


20 


.29 


20.21 


20 


.02 


19. 


.7 


5 


.0 


3 


.660 


1500 


90 


13.9 


105 


bubo J Izol+zYloB 


12 


51 


22 


.746 


+27 


15 


23. 


60 


23. 


22 


21 


.82 


20 


.78 


20.68 


20 


.57 


14. 


.6 


3 


.5 


3. 


.661 


1450 








SDSS J1307+0422A 


13 


07 


•56 


.734 


+04 


22 


15. 


.57 


20. 


22 


18 


.18 


17 


.97 


17.59 


17 


. 13 


140. 


.1 


24. 


.0 


3. 


.021 


710 


490 


8.2 


(5(5 


once Ti on? i r*/ioo"D 
oJJbo J loU / +U4zzr> 


13 


07 


56 


.184 


+04 


22 


15 


50 


21. 


4 1 


19 


. (58 


19 


.25 


19.02 


19 


. 12 


18. 


.3 


9 


.1 


3 


.028 


710 








SDSS J1312+4616A 


13 


12 


40 


.873 


+46 


16 


47. 


.8(5 


25. 


60 


21 


.78 


20 


.18 


19.86 


19 


.46 


49. 


.7 


8 


4 


3 


.971 


1450 


530 


85.7 


625 


oDbo J 1312+4615.B 


13 


12 


45. 


.032 


+46 


15 


33 


85 


25. 


.03 


22 


.17 


20 


.13 


20.25 


20 


.31 


99 


.8 


1 


.7 


3 


.963 


2000 








PSSJ1315+2924At 


13 


1 x, 

XV 




.566 


i on 

+zy 


L 1 


40. 


.7(5 


23. 


.00 


21 


.53 


19 


.71 


19.33 


19 


.35 


113. 


.4 


Q 

o 


A 
. 1 


4.18 


2000 


- 


(5.1 


41 


PSSJ1315+2924Bt 


13 


JLU 




.122 




24 


38. 


.9(5 




- 




- 


~24 


— 




— 


- 








4.18 


2000 








SDSS J1353+4852A 


13 


53 


28. 


.570 


+48 


52 


34. 


.06 


24. 


65 


21 


.71 


20 


.10 


20.40 


20 


.18 


62 


.7 


1 


.9 


3. 


.863 


520 


1130 


37.1 


273 


SDSS J1353+4853B 


13 


■JO 




.953 


4-4R 


■JO 


08. 


.53 


25. 


.23 


21 


.93 


20 


.88 


20.86 


20 


.81 


23. 


.7 


1 


I 


3. 


.845 


520 








SDSS J1404+4005A 


14 


01 


23. 


.046 


+40 


05 


11 


03 


24. 


07 


20 


.90 


19 


.57 


19.29 


19 


.22 


68. 


.5 


1 


.6 


3 


.999 


1450 


110 


47.3 


344 


SDSS J1404+4005B 


14 


OA 
U'l 


1 


.647 


4-40 


o r > 

U'J 


37 


79 


24 


.98 


22 


,17 


20 


.97 


20.50 


20 


.47 


18. 


.8 


I 


j 


3 


.997 


630 








SDSS J1414+3955A 


14 


14 


04. 


.169 


+39 


55 


43. 


.54 


20 


73 


18 


.72 


18 


.57 


18.64 


18 


,58 


264. 


.8 


3 


.1 


3 


.218 


520 


1950 


30.7 


242 


SDSS J1414+3955B 


14 


14 


UU. 


.766 


i on 
Toy 


O •. ) 


36. 


27 


21. 


.39 


21 


.31 


20 


.90 


20.86 


20 


.93 


24. 


.1 




Q 


3. 


.191 


520 








SDSS J1420+2831A 


14 


20 


23. 


.773 


+28 


31 


0(5. 


.57 


25. 


43 


22 


.18 


20 


.30 


19.93 


19 


.63 


60 


.8 


3 


.3 


4. 


.305 


710 


1470 


10.9 


77 


SDSS J1420+2830B 


14 




z o . 


.799 


4_9& 


ou 


55. 


.72 


23. 


.13 


23 


.13 


20 


.73 


20.22 


20 


.08 


65 


.2 


K 
O 




4. 


.279 


790 








SDSS J1439-0033A 


14 


39 


52. 


.579 


-00 


33 


59. 


.08 


25 


.14 


23 


.11 


20 


.79 


20.44 


20 


.22 


53. 


.6 


3 


.2 


4. 


.255 





170 


33.4 


237 


SDSS J1439-0034B 


14 


39 


51. 


.600 


-00 


34 


29. 


.12 


23. 


.11 


23 


.36 


21 


.76 


21.32 


20 


.99 


4. 


.8 


3 


.6 


4. 


.258 











SDSS J1541+2702A 


15 


41 


10 


.367 


+27 


02 


24. 


.8(5 


24. 


.66 


21 


.39 


20 


.18 


20.41 


20 


.58 


35 


.9 


2 


.7 


3. 


.623 


630 


810 


6.4 


48 


SDSS J1541+2702B 


15 


41 


10 


.403 


+27 


02 


31. 


22 


23. 


.47 


21 


.51 


20 


.12 


20.42 


20 


,14 


52. 


.6 


1 


.6 


3. 


.636 


520 








SDSS J1546+5134A 


15 


1(5 


14. 


.229 


+51 


34 


05. 


.00 


20. 


68 


19 


.39 


19 


.09 


18.93 


18 


.81 


11. 


,5 


5 


.4 


2 


.961 


1450 


1240 


42.2 


340 


SDSS J1546+5134B 


15 


46 


10. 


.538 


+51 


34 


29. 


.44 


22. 


17 


20 


.49 


20 


.31 


20.25 


20 


.63 


31. 


.4 


4 


.8 


2. 


.944 


650 








SDSS J1622+0702A 


16 


22 


10. 


.109 


+07 


02 


15. 


.34 


21. 


7(5 


17 


.43 


17 


.03 


16.85 


16 


.82 


849. 


.0 


1 


.7 


3 


.264 


790 


160 


5.8 


45 


SDSS J1622+0702B 


1(5 


22 


09. 


.815 


+07 


02 


11. 


.53 


25. 


.60 


20 


.68 


20 


.1(5 


20.03 


20 


.02 


31 


.7 


1 


.7 


3. 


.262 


630 








SDSS J1626+0904A 


1(5 


2(5 


12 


.067 


+09 


04 


37. 


35 


25. 


.42 


21 


.46 


20 


,50 


20.49 


20 


.37 


23. 


.7 


2 


.4 


3. 


.632 


1450 


450 


52.7 


398 


SDSS J1626+0904B 


1(5 


2(5 


38. 


.623 


+09 


04 


24. 


20 


25. 


.03 


21 


.67 


20 


.74 


20.68 


20 


.95 


18. 


.6 


2 


.8 


3. 


.639 


1450 








SDSS J1630+1152A 


1(5 


30 


55. 


.956 


+11 


52 


29. 


43 


21 


.80 


19 


.33 


18 


.93 


18.71 


18 


.66 


71 


.2 


4 


.4 


3 


.277 


1450 


970 


23.8 


186 


SDSS J1630+1152B 


1(5 


30 


56. 


.727 


+ 11 


52 


50 


.31 


2(5 


36 


20 


.63 


20 


.31 


20.35 


20 


.12 


81 


.7 


8 


.0 


3 


.291 


790 









Note. — Units of right ascension are hours, minutes, and seconds, and units of declination are degrees, arcminutes, and arcseconds. The 
brightest member of the pair in i-band is always labeled 'A' and the fainter member is labeled 'B'. Extinction-corrected SDSS five-band PSF 
photometry is given in the columns u, g, r, i, and z. The minimum stellar locus and quasar locus distances are denoted by the columns labeled 
X 2 t ar and Xphot ' respectively. The redshifts are indicated by the column labeled z, <r z is our estimate of the redshift uncertaintiy in km B , and 

Ad is the velocity difference between the two members of a pair in km s . The column labeled A9 is the angular separation in arcseconds and 
R± is the transverse proper separation in kpc. Note that z denotes the z-band magnitude, whereas z indicates redshift. 

t This quasar pair was selected via a different method during a pilot search for proto-clusters associated with z > 4 quasars (see § 14. H . The 
redshifts quoted are approximate and we did not measure a velocity difference. The fainter quasar PSS 1315+2924B is not detected in the SDSS 
imaging, its quoted r-band magnitude quoted is an estimate from Keck LRIS ij-band imaging. 
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Fig. 10. — Spectra of selected binary quasars in the redshift range 2.9 < z < 3.5. A three pixel boxcar smoothing has been applied to 
all spectra. The red curves indicate the 1— a error array. Object 'A' is always shown in the top panel of each plot. All spectra are from 
ESI with the exception of SDSSJ 1116+4118, for which we show the SDSS spectra. The strong features in the ESI spectra at 7600A are 
atmospheric telluric absorption. 




Fig. 10. — continued. 



of th e high-redshift gra vitational lenses from the sam- 
ple of llnada et all (|2008h . SDSS J1400+3134 at z = 3.32 
with image splitting 9 — 1.7". The largest separation 
lens discovered to date is the z = 2.2 quasar lensed into 
two images separa ted by 22.5" by a foreground cluster 
(|Inada et al.l [2006) . But the lower number density of 
z > 3 quasars make high redshift wide-separation lenses 
correspondingly rarer than at z ~ 2. Using cosmo log- 
ical ray-tracing simulations, iHennawi et al.l (|2007al ) de- 
termined that a quasar sample with the flux limit of our 
search should contain ~ 2 lenses with image splitting 
6 > 10" and z > 3. However, this estima te is extremely 
sensi tive to the assumed value of er 8 ( Li et al.l 120061. 
[2001 which was high (er 8 = 0.95) for the IHennawi et all 



2007ai) study; the expectation for hi gh-redshift len ses 



is therefore lower by a factor of > 5 t|Li et al.1l2007h if 
one adopts the lower v alue of as = 0-80 fav ored by the 
WMAP five-year data (jDunklev et al.ll2008l ). 

A quasar pair can be positively confirmed as a bi- 
nary if the spectr a of the images are vastly different (c.f. 
iGregg et al.lf200l . if only one of the images is radio-lou d 
(an 2 R pair, in the notation of iKochanek et al.lll999f h 
or if the quasars' hosts are detected and they are not 



clearly lensed. The sufficient conditions for a pair to be 
identified as a lens are the presence of more than two 
images in a lensing configuration, the measurement of a 
time delay between images, the detection of a plausible 
deflecto r, or the detection of lensed host galaxy emis sion 
(see e.g. IKochanek et alJl!999t iMortlock et al.)ll999h . 

The sample of binaries in Table [3] contains ten sys- 
tems with 9 < 20" which we consider as potential 
lenses. We have ESI spectra of all but two of these, 
SDSSJ 0004-0844 20 and PSSJ 1315+2924. For all cases 
with ESI spectra, we can convincingly rule out the lens- 
ing hypothesis because of their spectral dissimilarity (see 
Figures [TD] and [TTJ). In addition, SDSSJ 1307+0422 A 
was detected in the FIRST survey with 20cm flux Fa — 
14.3 mjy. Using i-band apparent magnitudes to compute 
a flux ratio implies the flux of the second image would 
be Fb = 3.8 mjy under the lensing hypothesis. This is 
four times brighter than the FIRST flux limit, however, 

20 A spectrum of SDSSJ 0004-0844 was obtained the Magellan 
Echellette (MagE) Spectrograph on the Magellan Clay Telescope. 
These data are not published here, but the different Lya emission 
line profiles also allow us to compellingly argue against gravita- 
tional lensing. 
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Fig. 11. — Same as Figure ITOl but for binary quasars with z > 3.5. 
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Fig. 11. — continued. 




Fig. 11. — continued. 



no radio counterpart is detected 21 . 

For both SDSSJ 0004-0844 (z = 3.00, 6 = 4.4") and 
PSSJ 1315+2924 (z = 4.18, 9 = 6.1") we have obtained 
deep imaging usi ng the Low Res olution Imaging Spec- 
trograph (LRIS; lOke et al.l Il995f ) on the Keck I tele- 
scope. For SDSSJ 0004-0844 our LRIS i?-band image 
reaches an approximate point source depth of R ~ 24, 
and there is no evidence for a lensing galaxy or cluster. 
Our LRIS image of PSSJ 1315+2924 reaches a depth 
of R ~ 26, and there is similarly no evidence for a 
lens. Although the absence of a deflector in images of 
a quasar pair does not strictly speaking confirm the bi- 
nary hypothesis, it makes it very likely. For wide sep- 
aration lenses with A0 > 3", such as SDSSJ1004+4112 
(A6» max = 14. 62", zi fins = 68), Q 0957+561 (A0 = 6.2", 
ziens = 0.36: iWalsh eTail [i"979h . or SDSSJ 1029+2623 
(A6> max = 22.5", zi cns ~ 0.6) where the lens is a bright 
galaxy in a cluster or group, the lens galaxies would have 
r ~ 20 — 21 at the most probable redshifts of z = 0.3 — 0.7 
(|Hennawi et al.l r2007af ). and would thus be easily de- 

21 Although the optical flux ratio could be anomalous relative 
to the radio because of microlensing, an omalies as large as a factor 
of ~ 4 are rare fe.e. lPoolev et al.H2007T) . 



tectable in our LRIS images. Although we cannot com- 
pletely rule out lenses with z > 1, a wide separation 
z > 3 quasar lensed by such a high redshift group or 
cluster is extremely unlikely because structures with ap- 
preciable strong-lensing cross-section are predic ted to be 
extremely rare at z > 1 (jHennawi et al]l2007al fa). Thus 
we are confident that all of the quasar pairs in our sample 
are indeed binary quasars. 

5. SUMMARY AND CONCLUSIONS 

We have conducted the first systematic survey to dis- 
cover high redshift binary quasars. Using color-selection 
and photometric redshift techniques, we mined 8142 deg 2 
of SDSS imaging data for binary quasar candidates, and 
confirmed them spectroscopically with follow-up obser- 
vations on 4m class telescopes. Our sample of 24 new 
binaries at redshifts 2.9 < z < 4.3 with (proper) trans- 
verse separations 10 kpc < R± < 650 kpc increases the 
number of such objects known by an order of magnitude. 
Eight members of this sample are very close pairs with 
R± < 100 kpc, and of these close systems four are at 
z > 3.5. More importantly, whereas the only previously 
known close binary at z ~ 4 was discovered serendip- 
itously ([Schneider et al.ll2000D . we defined an algorithm 
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for homogeneously selecting binary candidates and quan- 
tified its completeness and efficiency using simulations. 

Armed with this new sample of binaries and a well- 
understood selection function, Paper II presents the 
first measurement of the small-scale clustering (R < 
1 h^ 1 Mpc comoving) of high-redshift quasars. On 
large scales, high-redshift quasars cluster much more 
stron gly than their low-redshift counterparts (|Shen et al.l 
12007ft . indicating that quasars are hosted by extremely 
massive dark matter halos (M > 10 13 M ) - 
perhaps not so surprising as these quasars likely har- 
bor the most extrem e black holes (M > 10 9 M©) 
in the Universe (e.g. IShen et ail [2008b). Small-scale 
clustering measurements characterize the 10 kpc — 
1 Mpc scale environments of these extreme black holes, 
providing im portant clues about how quasar activity 
is triggered jThacker et al l 120061; [Hopkins et all 120081 : 
IWetzel et all l2009bl iThacker et al.l l2008|) and constrain- 
ing how quasars populate their dark matter halos 
(IWetzel et al.ll2009bl : IShankar et alJ [20091. Because our 
survey is only about half complete, we can only place 
lower limits on the small-scale clustering. Nevertheless, 
even these lower limits provide important constraints on 
models (see Paper II). 

The massive dark halos inhabited by high-redshift 
quasars (jShen et alJ l2008b[ ) suggests that high-redshift 
binaries represent exponentially rare conjunction of two 
extremely massive black holes (and possibly two dark 
matter halos) . Furthermore, the simultaneity of their lu- 
minous quasar phases is another important coincidence. 
Using the luminosity function and the number of bina- 
ries detected in our survey, we can estimate the frac- 
tion of quasars which have an active companion. In 
the redshift range 3.5 < z < 4.5, there are -/Vqso(« < 
21) = 1.9 deg~ 2 , or about - 15,000 in the SDSS area 
we surveyed. We uncovered four binary quasars with 
R < 100 kpc (9 < 15") and i < 21, which given our 
incompleteness (see Figure [7]), implies ~ 8 binaries actu- 
ally exist. Thus the companion probability is extremely 
small ~ 0.05%. Equivalently, over this redshift range 
our survey volume is ~ 100 Gpc 3 (comoving), imply- 
ing a comoving number density nonary ~ 10 -10 Mpc - , 
or about one binary per 10 Gpc 3 . For comparison, this 
is an order of magnitude rare r than the extremely rare 
z ~ 6 quasars (jFan et al.ll2001h . and corresponds to a vol- 
ume thirty times larg er than the Millennium Simulation 
(|St>ringel et al.ll2005h . Despite their paucity, the hand- 
ful of binaries presented he re amount to a substantial 
small-scale clustering signal (S hen et al.ll2009f ): although 
the observed pair fraction is incredibly low, the proba- 
bility of such associations occurring at random is orders 
of magnitude lower. 

Binaries are scarce in our survey partly because at 
z = 4, i < 21 corresponds to an extremely luminous (and 
therefore rare) quasar, Mi < — 26. 5 22 , i.e. comparable to 
3C 273 and about ~ 3.5 magnitudes brighter than the 
canonical distinction (Mb < —23) between quasars and 
Seyferts. Upcoming deep synoptic surveys such as the 
Panoramic Sur vey Telescope fc R apid Response System 
(Pan-STARRS iKaiser et all I2002D and the Large Syn- 



optic Survey Telescope (LSST llvezic et al.l 12008). will 
combine variability and color-selection to identify large 
photometric samples of faint quasars with high complete- 
ness and efficiency. The number of binaries uncovered by 
these surveys scales as N oc Nq SO A, where Aqso is the 
quasar number counts and A is the survey area. Extrapo- 
lating our luminosity function to i < 24.5 (i.e. M, < —23 
at z = 4), we find N QSO {i < 24.5) ~ 30 deg" 2 for 
3.5 < z < 4.5, or about 15 times the number density of 
the SDSS in this range. The LSST will discover all active 
galaxies over 20,000 deg -2 even beyond this depth, im- 
plying a factor of ~ 600 increase in the number of quasar 
pairs. If we naively assume clustering is independent of 
luminosity 23 and extrapolate from the size of our sam- 
ple, we predict ~ 5000 binary quasars with R < 100 kpc 
and 3.5 < z < 4.5 in the LSST imaging. The shallower 
(i < 24) but wider (A = 3tt) Pan-STARRSl survey will 
contain a comparable number. The clustering measure- 
ment errors scale as the square-root of the number of 
binaries, impl ying errors abo ut 25 times smaller than 
achieved in (jShen et al.l l2009h from our sample. Fur- 
thermore, the fainter sample and dramatically improved 
statistics will allow a precise investigation of the redshift 
and luminosity dependence of both small and large scale 
clustering, resulting in powerful constraints on clustering 
models. 

Searching for galaxy overdensities around the binaries 
published here will shed light on their environments and 
the mechanism triggering their quasar activity. Although 
the separations of our binary quasars are too large for di- 
rect gravitational interactions to trigger accretion, they 
could reside in the same overdense environments whereby 
interactions wit h other galaxies make quasar activit y 
more likely (e.g. lDjorgovsk1ll99H iHopkins et alJ 12008). 
The idea of using quasars as beacons to pin point dense 
regions in the early Universe is not new (e.g. iDiorgovskil 
1999; iDiorgovski et al.|[l999h . Deep imaging studies of 
high-redshift quasar environments in search of overden- 
sities and proto-clusters have been conducte d at z ~ 3 
(e.g. iSteidel et alJ fl999l llnfante et all 120031) . bv Djor- 
govski and collaborators at z ~ 4 (jDjorgovskil 119991 : 
IDiorgovski et al. I1999L l2~003h. and by several 
at z ~ 5 



t al. IliJijiJl I^UUdl). and by several groups 
6. ( Stiavelli et all 120051 iZheng et al.l [20061: 



Kashikawa et all 120071; iKim et alJ 120081 : iPriddev et all 
2008h . While it is still unclear whether high-redshift 



quasars reside in proto-clusters, compelling evidence 
for overdensities has been found around a handful of 
z > 4 radio galaxies (e.g IVenemans et all I2002L 120041 
Miley et al.l 120041; lOverzier et all 120061 : IVenemans et al.1 



20071; lOverzier et al.ll2008D . If similar deep imaging and 



22 Following Richards ct al. (2006) we quote i-band absolute 
magnitudes K-corrected to z = 2. 



multi-object spectroscopy of our high redshift binaries 
uncovers proto-cluster environments, this would estab- 
lish that either the binaries are the highest sigma peaks 
in the early Universe, or that enhanced quasar activity is 
triggered by galaxy interactions. Furthermore, the kine- 
matics and size of such an overde nsity could dist i nguish 
these alternatives. Intriguingly, iFukugita et alJ (|2004l ) 
obtained deep Subaru images of the z = 4.25 quasar pair 
discovered by ISchneider et al.l (j2000) and found no ev- 

23 The luminosity dependence of large-scale quasar clustering at 
high-redshift has yet to be quantified, although there is tentative 
evidence that the brightest quasars cluster more strongly at z < 2 
(Shen ct al. 2001) . 
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idence for an overdensity of galaxies. But the tenfold 
increase in high-redshift binaries published here provides 
ample opportunity for future detailed statistical studies. 

We conclude with the reminder that our survey for 
quasar pairs is ongoing and less than 50% complete. Al- 
though the binary yield will likely drop because we ob- 
served the best candidates first, we anticipate discovering 
10 — 20 more high-redshift binaries in the SDSS imaging. 
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